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Abstract. We compare the expressiveness of two extensions of monadic second-order 
logic (MSO) over the class of finite structures. The first, counting monadic second-order 
logic (CMSO), extends MSO with first-order modulo-counting quantifiers, allowing the 
expression of queries like "the number of elements in the structure is even". The second 
extension allows the use of an additional binary predicate, not contained in the signature of 
the queried structure, that must be interpreted as an arbitrary linear order on its universe, 
obtaining order-invariant MSO. 

While it is straightforward that every CMSO formula can be translated into an equiva- 
lent order-invariant MSO formula, the converse had not yet been settled. Courcelle showed 
that for restricted classes of structures both order-invariant MSO and CMSO are equally 
expressive, but conjectured that, in general, order-invariant MSO is stronger than CMSO. 

We affirm this conjecture by presenting a class of structures that is order-invariantly 
definable in MSO but not definable in CMSO. 



1. Introduction 



Linear orders play an important role in descriptive complexity theory since certain re- 
sults relating the expressive power of logics to complexity classes, e.g., the Immerman-Vardi 
Theorem that LFP captures Ptime, only hold for classes of linearly ordered structures. 
Usually, the order only serves to systematically access all elements of the structure, and 
consequently to encode the configurations of a step-wise advancing computation of a Tur- 
ing machine by tuples of elements of the structure. In these situations we do not actually 
want to make statements about the properties of the order, but merely want to have an 
arbitrary linear order available to express the respective coding techniques. 

Furthermore, when actually working with finite structures in an algorithmic context, 
e.g., when evaluating queries in a relational database, we are in fact working on an implicitly 
ordered structure since, although relations in a database are modelled as sets of tuples, the 
relations are nevertheless stored as ordered sequences of tuples in memory or on a disk. As 



1998 ACM Subject Classification: F.4.1 Mathematical Logic. 

Key words and phrases: MSO, Counting MSO, order-invariance, expressiveness, 
2;ame. 



Ehrenfeucht-Fraisse 



SYMPOSIUM 
ON THEORETICAL 
ASPECTS 
OF COMPUTER 
SCIENCE 



© T. Ganzow and S. Rubin 

© Creative Commons Attribution-NoDerivs License 



314 



T. GANZOW AND S. RUBIN 



this linear order is always available (though, as in the case of databases, it is implementation- 
dependent and may even change over time as tuples are inserted or deleted), we could allow 
queries to make use of an additional binary predicate that is interpreted as a linear order 
on the universe of the structure, but require the outcome of the query not to depend on the 
actual ordering, but to be order-invariant. Precisely, given a r-structure 21, we allow queries 
built over an expanded vocabulary rU{<}, and say that a query (p is order-invariant if 
(21, <i) \= (f '^=^ (21, <2) 1= (/J for all possible relations <i and <2 linearly ordering A. 

Using Ehrenfeucht-Fraisse-games for MSO, one can see that MSO on sets (i.e., struc- 
tures over an empty vocabulary) is too weak to express that the universe contains an even 
number of elements. However, this is possible if the universe is linearly ordered: simply 
use the MSO sentence stating that the maximal element should be contained in the set of 
elements on even positions in the ordering. Obviously, such a sentence is order-invariant 
since rearranging the elements does not affect its truth value. Gurevich uses this observation 
to show that the property of Boolean algebras having an even number of atoms, although 
not definable in FO, is order-invariantly definable in FO (simulating the necessary MSO- 
quantification over sets of atoms by FO-quantification over the elements of the Boolean 
algebra) . 

If we explicitly add modulo-counting to MSO, e.g., via modulo-counting first-order 
quantifiers such as "there exists an even number of elements x such that . . . " , we ob- 
tain counting monadic second-order logic (CMSO), and the question naturally arises as 
to whether there are properties not expressible in CMSO that can be expressed order- 
invariantly in MSO. 

In fact, a second separation example due to Otto gives a hint in that direction. The 
class of structures presented in [OttOOj even separates order-invariant FO from FO extended 
by arbitrary unary generalised quantifiers, i.e., especially modulo-counting quantifiers, and 
exploits the idea of "hiding" a part of the structure such that it is only meaningfully usable 
for queries in presence of a linear order (or, as actually proven in the paper, in presence of 
an arbitrary choice function). 

The expressiveness of CMSO has been studied, e.g., in |Cou90| . where it is mainly 
compared to MSO, and in [CouOS ] it is shown that, on the class of forests, order-invariant 
MSO is no more expressive than CMSO. As pointed out in |BS05| . this can be generalised 
using results in |Lap98| to classes of structures of bounded tree-width. But still, this left 
open Courcelle's conjecture: that order-invariant MSO is strictly stronger than CMSO for 
general graphs |Cou96l Conjecture 7.3]. 

In this paper, we present a suitable characterisation of CMSO-definability in terms of 
an Ehrenfeucht-Fraisse game, and later, as the main contribution, we present a separating 
example showing that a special class of graphs is indeed definable by an order-invariant 
MSO sentence but not by a counting MSO sentence. 

2. Preliminaries 

Throughout the paper N denotes the set of non- negative integers and N'*' := N — {0}. 
Given a non-empty finite set M = {mi, . . . , ruk] Cgn N+, let lcm(M) := lcm(mi, . . . , mk) 
denote the least common multiple of all elements in M; additionally, we define lcm(0) = 1. 
For sets X and Y as well as M as before, we abbreviate that \X\ = \Y\ (mod m) for all 
m G M by using the shorthand \X\ = \Y\ (mod M). 
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We restrict our attention to finite r-structures with a nonempty universe over a count- 
able relational vocabulary r, possibly with constants, and we will mainly deal with monadic 
second-order logic and some of its extensions. For more details concerning finite model the- 
ory, we refer to |EF95j or |Lib04j . 

When comparing the expressiveness of two logics £ and we say that £' is at least 
as expressive as £, denoted C C if for every E C[t] there exists a G £'[t] such that 
Mod((/9) = Mod(99'), where Mod((/?) denotes the class of all finite r-structures satisfying ip. 

2.1. Counting MSO 

The notion of (modulo-)counting monadic second-order logic (CMSO) can be intro- 
duced in two different, but nonetheless equivalent, ways. The first view of CMSO is via an 
extension of MSO by modulo-counting first-order quantifiers. 

Definition 2.1. Let r be a signature and M C N"^ a set of moduli, then 

— every formula tp € MSO[t] is also a formula in CMSO^*^^[r], and 

- if V7(x) G CMSO(*^)[r] and m G M, then 3('^)x.(^(j;) G CMSO(^^)[r]. 

If we do not restrict the set of modulo-counting quantifiers being used, we get the full 
language CMSO[t] = CMSO(^^)[ r]. The semantics of MSO formulae is as expected, and 
we have 21 \= 3^'^'^ x .ip{x) if and only if |{a G A : 21 ^ v(^)}| = (mod m). The quantifier 
rank qr('0) of a CMSO[t] formula tjj is defined as for MSO-formulae with the additional 
rule that qr (3(™)x.(^(x)) = 1 + qr((/9), i.e., we do not distinguish between different kinds of 
quantifiers. 

In this paper we use an alternative but equivalent definition of CMSO, namely the 
extension of the MSO language by monadic second-order predicates C^*"-* which hold true 
of a set X if and only if \X\ = (mod m). As in the definition above, formulae of the 
fragment CMSO^^^^ [r] may only use predicates C^™^ where m G M. The back-and-forth 
translation can be carried out along the following equivalences which increase the quantifier 
rank by at most one in each step: 

B^'^^x.pix) = 3X(C(™)(^) AVx(Xx ^ 99(x))) and 

Furthermore, the introduction of additional predicates C^^'^^ (or, equivalently, addi- 
tional modulo-counting quantifiers 3^™''")) stating for a set X that \X\ = r (mod m) does 
not increase the expressive power since they can be simulated as follows (with only a con- 
stant increase of quantifier rank): 

= 3Xo("Xo C X" A "|Xo| = r" A "C(™)(X \ Xq)") , 

where all subformulae are easily expressible in MSO. 

Later, we will introduce an Ehrenfeucht-Frai'sse game capturing the expressiveness of 
CMSO with this extended set of second-order predicates. 
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2.2. Order-invariance 

Let r be a relational vocabulary and ip € MSO[rU{<}], i.e., ip may contain an addi- 
tional relation symbol <. Then ip is called order-invariant on a class C of t- structures if, 
and only if, (21, <i) |= ip <^=^ (21, <2) |= f for all 21 G C and all linear orders <i and <2 
on A. 

Although, in general, it is undecidable whether a given MSO-formula is order-invariant 
in the finite, we will speak of the order-invariant fragment of MSO, denoted by MS0[^]j7j^;, 
that contains all formulae that are order- invariant on the class of all finite structures. 

It is an easy observation that every CMSO formula is equivalent over the class of all 
finite structures to an order- invariant MSO formula by translating counting quantifiers in 
the following way: 

3^'^^x.ip{x) := 3X3Xo . . . 3Xg_i 

/ Vx (Xx ^ fix)) A "{Xq, . . . , is a partition of X" \ 

A 3x[Xqx a yy{Xy ^ x < y)) A 3x{Xq^ix A yy{Xy ^ x > y)) 

XiX ^ X^^i (ujod g)y , 

where 5'(p,< defines the successor relation induced by an arbitrary order < on the universe 
of the structure restricted to the set X of elements for which f holds. 

Note that the quantifier rank of the translated formula is not constant but bounded by 
the parameter in the counting quantifier. 



3. An Ehrenfeucht-Praisse game for CMSO 

The Ehrenfeucht-Fraisse game capturing expressiveness of MSO parameterised by the 
quantifier-rank (cf. [EF95t ILib04j ) can be naturally extended to a game capturing the ex- 
pressiveness of CMSO parameterised by the quantifier rank and the set of moduli being 
used in the cardinality predicates or counting quantifiers. 

Viewing CMSO as MSO with additional quantifiers 3'^'^'^ x .f{x) for all m in a fixed set 
M leads to a new type of move described, e.g., in the context of extending FO by modulo- 
counting quantifiers in [NurOO] . Since a modulo-counting quantifier actually combines no- 
tions of a first-order and a monadic second-order quantifier in the sense that it makes a 
statement about the cardinality of a certain set of elements, but on the other hand, it be- 
haves like a first-order quantifier binding an element variable and making a statement about 
that particular element, the move capturing modulo-counting quantification consists of two 
phases. First, Spoiler and Duplicator select sets of elements S and D in the structures such 
that \S\ = \D\ (mod M), and in the second phase. Spoiler and Duplicator select elements a 
and b such that a € if and only if 6 G D. After the move, reflecting the first-order nature 
of the quantifier, only the two selected elements a and b are remembered and contribute to 
the next position in the game, whereas the information about the chosen sets is discarded. 

We prefer viewing CMSO via second-order cardinality predicates, yielding an Ehren- 
feucht-Frai'sse game that allows a much clearer description of winning strategies. Since 
we do not have additional quantifiers, we have exactly the same types of moves as in the 
Ehrenfeucht-Fraisse game for MSO, and we merely modify the winning condition to take 
the new predicates into account. 
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Towards this end, we first introduce a suitable concept of partial isomorphisms between 

structures. 

Definition 3.1. With any structure 2t and any set MCg^N"*" wc associate the (first- 
order) power set structure 21^ := (V{A), (C^"^'^^) m&M where the predicates C^"^'^^ are 

0<r<m 

interpreted in the obvious way. (Note that first-order predicates in the power set structure 
21^ naturally correspond to second-order predicates in 21.) 

Let 21 and *B be r-structures, and let MCg^N"'' be a fixed set of moduli. Then the 
mapping (^i, . . . , As, ai, . . . ,at) ^ (i?i, . . . , Bg, bi, . . . , bt) is called a twofold partial isomor- 
phism between 21 and 55 with respect to M if 

(i) {ai,. . . ,at) 1-^ (6i, . . . , bt) is a partial isomorphism between (21, Ai, . . . , Ag) and 
i^,Bi,...,Bg) and 

(ii) (^1, . . . , As) I— > {Bi,. . . , Bg) is a partial isomorphism between 21^ and 

Wc propose the following Ehrenfeucht-Fraisse game to capture the expressiveness of 
CMSO where the use of moduli is restricted to a (finite) set M and formulae of quantifier 
rank at most r. 

Definition 3.2 (Ehrenfeucht-Fraisse game for CMSO). Let MCfinN+ and r G N. The 
r-round (mod M) Ehrenfeucht-Fraisse game Q^{%,^) is played by Spoiler and Duplicator 
on r-structures 21 and *B. In each turn, Spoiler can choose between the following types of 
moves: 

— point move: Spoiler selects an element in one of the structures, and Duplicator 
answers by selecting an element in the other structure. 

— set move: Spoiler selects a set of elements X in one of the structures, and Duplicator 
responds by choosing a set of elements Y in the other structure. 

After r = s + t rounds, when the players have chosen sets Ai, . . . ,As and Bi, . . . ,Bs as well 
as elements ai, . . . , and bi, ... ,bt in slu arbitrary order. Duplicator wins the game if, and 
only if, (^1, . . . , As, ai, . . . ,at) f-^ {Bi, . . . , Bs, bi, . . . , bt) is a twofold partial isomorphism 
between 21 and 53 with respect to M. 

First note that, although Duplicator is required to answer a set move X hy a set Y 
such that \X\ = \Y\ (mod M) in order to win, we do not have to make this explicit in the 
rules of the moves since these cardinality constraints are already imposed by the winning 
condition {X and Y would not define a twofold partial isomorphism if they did not satisfy 
the same cardinality predicates). Furthermore, for M = 0orM = {l}, the resulting game 
Q^{^2i,^) corresponds exactly to the usual Ehrenfeucht-Fraisse game for MSO. 

Theorem 3.3. Let 21 and *B be r-structures, r G N, and M Cy^„ N. Then the following are 
equivalent: 

(i) 2t =f 03, i.e., 21 ^ i/ and only if^^^p for all ip G CMSO(^)[r] with qr((p) < r. 

(ii) Duplicator has a winning strategy in the r-round (mod M) Ehrenfeucht-Fraisse game 
G^' {%"&). m 

To prove non-definability results, we can make use of the following standard argument. 

Proposition 3.4. A class C of r-structures is not definable in CMSO if, for every r £ N 
and every M C^„N+, there are r-structures ^M,r o,nd ^M,r such that ^M,r £ C, ^M,r ^ C, 

and ^M,r ^M,r- 
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The following lemma, stating that the CMSO-theory of disjoint unions can be deduced 
from the CMSO-theories of the components, can either be proved, as carried out in [C ou90[ 
Lemma 4.5], by giving an effective translation of sentences talking about the disjoint union 
of two structures into a Boolean combination of sentences each talking about the individual 
structures, or by using a game-oriented view showing that winning strategies for Duplicator 
in the games on two pairs of structures can be combined into a winning strategy on the pair 
of disjoint unions of the structures. 

Lemma 3.5. Let 2li,2l2, 5Bi, and ^2 be t -structures such that 2li =f/ !Bi and 2t2 ^2- 
Then 2I1U2I2 =^.^ 53iU!B2. 

Proof. Consider the game on 21 := 2liU2l2 and 53 := !BiU*B2. A Spoiler's point move 
in 21 (resp., in !B) is answered by Duplicator according to her winning strategy in either 
^^■^(2li,!Bi) or a*^(2l2,!B2). A set move 5 C A (analo gous for 5 C i?) is decomposed into 
two subsets Si := S" fl Ai and 5*2 := 5 fl A2, and is answered by Duplicator by the set 
D := Di U D2 consisting of the sets Di and D2 chosen according to her winning strategies 
as responses to Si and ^2 in the respective games ^^^^(Sti, and ^^(212,582). 

Since Ai and A2 as well as Bi and B2 are disjoint, we have |5| = |S'i| + |5'2| and 
\D\ = \Di\ + \D2\. Furthermore, \Si\ = \Di\ (mod M) and |S'2| = \D2\ (mod M) as the 
sets Di and D2 are chosen according to Duplicator's winning strategies in the games on 
2ti and !Bi, and 2I2 and *B2, respectively. Since = (mod M) is a congruence relation with 
respect to addition, we have that l^l = \D\ (mod M). It is easily verified that the sets 
and elements chosen according to this strategy indeed define a twofold partial isomorphism 
between 21 and *B. ■ 

As a direct corollary we obtain the following result that will be used in the inductive 
step in the forthcoming proofs. 

Corollary 3.6. Let 2ti,2l2,5Bi, and QS2 be r-structures, such that 2li *Bi and 2I2 
532. T/ien (2liU2l2,Ai) (?BiU!B2, Si). 

Proof. We consider the following T U {P}-expansions of the given structures: 21'^ := (2li,^i), 
:= («Bi,5i), % := (2l2,0), and % := (*B2,0). It is immediate that 

(i) 2li =f !Bi implies (2li, Ai) (?Bi,5i), and 

(ii) 2I2 =f 532 implies (2l2,0) =f (532,0) 

since Duplicator can obviously win the respective Ehrenfeucht-Fraisse games on the ex- 
panded structures using the same strategies as in the games proving the equivalences on 
the left-hand side. The claim follows by applying the previous lemma to the r U {P}- 
expansions. ■ 

It is well known that MSO exhibits a certain weakness regarding the ability to specify 
cardinality constraints on sets, i.e., structures over an empty vocabulary. A proof of this 
fact using Ehrenfeucht-Fraisse games can be found in [Lib04] . By adapting this proof, we 
show that this is still the case for CMSO. 

Lemma 3.7. Let 2t and 53 be (I) -structures, MC^„N+, and r G N. Then 21 53 if 
\A\,\B\ > (2'"+! -4)lcm(M) and \A\ = \B\ (mod M). 

Proof. We prove by induction on the number of rounds that Duplicator wins the (mod M) 
r-round Ehrenfeucht-Fraisse game £/*''^(2t, 53). For r = and r = 1 the claim is obviously 
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true. Let r > 1, assume that the claim holds for r — 1, and consider the first move of the 
r-round game. We assume that Spoiler makes his move in 21 since the reasoning in the other 
case is completely symmetric. 

If Spoiler makes a set move S A, we consider the following cases: 

(1) l^l < {2'' - 4) • lcm(M) (or \A - S\ < {2' - 4) • lcm(M)). Then Duplicator selects 
a set 1) C 5 such that \D\ = \S\ (or \B - D\ = \A - S\), and hence 5 = D and 
A-S =^/_^ B - D (oi A- S ^ B - D and S =f_^ D). 

(2) l^l, 1^-51 > (2'"-4)-lcm(M). Then Duplicator selects a set L> C 5 such that \D\ = \S\ 
(mod M) and |L>|, \B -D\> {T' - 2) • lcm(M). In fact, she chooses for D half of the 
elements and chooses £ < 1cm (M) additional ones to fulfil the cardinality constraints 
\D\ = \S\ (mod M). Then, for the set B — D oi non-selected elements, we have 

\B-D\> ^ ((2''+^ - 4) lcm(M)) - ^ > (2^ - 2) lcm(M) - lcm(M) 
> (2'' - 4) lcm(M) 

for ah i satisfying < ^ < lcm(M). Since \D\ = \B - D\ + 2i, obviously \D\ > 
(2'' - 4) lcm(M) as weh. 

Thus, in both cases, by the induction hypothesis we get S =rLi D and A — S =rLi B — D. 
Hence, by Corollarv 13.61 {A, S) =^-1 (-^i -^)' i-^-i Duplicator has a winning strategy in the 
remaining (r — l)-round game from position {S,D). 

If Spoiler makes a point move s (z A, Duplicator answers by choosing an arbitrary ele- 
ment d G B. Similar to Case 1 above, we observe that {{s}, s) = {{d},d) and A — {s} 
B — {d} by the induction hypothesis. Thus, by Lemma [33} {A,s) =,r-i {B,d) implying 
that Duplicator has a winning strategy for the remaining r — 1 rounds from position (s, d). ■ 



4. The Separating Example 

We will first give a brief description of our example showing that MSO[<]m„ is strictly 
more expressive than CMSO. We consider a property of two-dimensional grids, namely 
that the vertical dimension divides the horizontal dimension. This property is easily defin- 
able in MSO for grids that are given as directed graphs with two edge relations, one for 
the horizontal edges pointing rightwards, and one for the vertical edges pointing upwards, 
by defining a new relation of diagonal edges combining one step rightwards and one step 
upwards wrapping around from the top border to the bottom border but not from the right 
to the left border. Note that there is a path following those diagonal edges starting from 
the bottom-left corner of the grid and ending in the top-right corner if, and only if, the 
vertical dimension divides the horizontal dimension of the grid. Thus, for our purposes, we 
have to weaken the structure in the sense that we hide information that remains accessible 
to MSO[<]jn„-formulae but not to CMSO formulae. 

An appropriate loss of information is achieved by replacing the two edge relations with 
their reflexive symmetric transitive closure, i.e., we consider grids as structures with two 
equivalence relations which provide a notion of rows and columns of the grid. Obviously, 
notions like corner and border vertices as well as the notion of an order on the rows and 
columns that were important for the MSO-definition of the divisibility property are lost, 
but clearly, all these notions can be regained in presence of an order. First, the order allows 
us to uniquely define an element (e.g. the <-least element) to be the bottom-left corner of 
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the grid, and second, the order induces successor relations on the set of columns and the 
set of rows, from which both horizontal and vertical successor vertices of any vertex can be 
deduced. Since the divisibility property is obviously invariant with respect to the ordering 
of the rows or columns, this allows for expressing it in MSO In the course of this 

section we will develop the arguments showing that CMSO fails to express this property on 
the following class of grid-like structures. 

Definition 4.1. A cliquey {k,£)-grid is a ~t,}-structure that is isomorphic to (5ke '■= 
({0, . . . ,k - 1} X {0,. . . ,i - l},r^h, where 

~?i ■= {{{x,y),{x',y')) : X = x'} and 

~i> ■={{{x,y),{x',y')) :y = y'}, 

i.e., ~/i consists of exactly k equivalence classes (called rows), each containing £ elements, 
and ~t, consists of exactly i equivalence classes (called columns), each containing k elements, 
such that every equivalence class of ~/i intersects every equivalence class of m exactly 
one element and vice versa. 

A horizontally coloured cliquey {k,£)-grid, denoted is the expansion of the {~t;}- 
reduct of the cliquey grid &ke by unary predicates {Pi, . . . , P^}, where the information of 
is retained in the k new predicates (in the following referred to as colours) such that each 
set Pi corresponds to exactly one former equivalence class. 

Note that the same class of grid-like structures has already been used by Otto in a proof 
showing that the number of monadic second-order quantifiers gives rise to a strict hierarchy 
over finite structures |Ott95| . 

The class is first-order definable by a sentence V'grid stating that 

— and ~/i are equivalence relations, and 

— every pair consisting of one equivalence class of ~/i and each has exactly one 
element in common 

as these properties are sufficient to enforce the desired grid-like structure. Note that even the 
second property is first-order definable since every equivalence class is uniquely determined 
by each of its elements. 

The following two lemmata justify the introduction of the notion of horizontally coloured 
cliquey grids for use in the forthcoming proofs. 

Lemma 4.2. Let ©fe^!,; ^'ki' > '^'^^ ^ki' horizontally coloured cliquey grids such 

fhnf (•5'=°' irnd (5^^°' Thpn (5^^°' =^ 

mai — r ana \o^^^ — ^ ki'^- ^k,ii+t2 —r ^k/'^+e'^' 

Proof. Note that, since there are no horizontal edges in horizontally coloured cliquey grids 
and the vertical dimension of all grids is k, is the disjoint union of the two smaller 

horizontally coloured cliquey grids 6^°' and , and of course, the same holds for (5™]/ ■ 



Thus, the claim follows by Lemma [ 
Lemma 4.3. Let (5f^ =f Then &ke =^ ^^ki'- 

Proof. For each fixed horizontal dimension k, there exists a one-dimensional quantifier- free 
interpretation of a cliquey grid in its respective horizontally coloured counterpart since we 
can define the horizontal equivalence relation ~/j in terms of the colours as follows: 

k 

X ^hV = \/ PiX /\Piy . m 
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Actually, the argument implies that Duplicator wins a game on cliquey grids using the 
same strategy that is winning in the corresponding game on coloured grids since a strategy 
preserving the colours of selected elements especially preserves the equivalence relation 

Before stating the main lemma, we will first prove a combinatorial result which will later 
help Duplicator in synthcsising her winning strategy and introduce the following weakened 
notion of equality between numbers. 

Definition 4.4. Two numbers a, 6 € N are called threshold t equal {mod M), denoted 
a =f b, if 

(i) a = b or 

(ii) a,b>t and a = b (mod M). 

Intuitively, a = f' b means that the numbers arc cqiial if they are small, or that they are at 
least congruent modulo all m G M if they are both at least as large as the threshold t. 

Lemma 4.5. For every p, t G N, and M C^^N"*", we can choose an arbitrary T > p ■ (t + 
lcm(M) — 1) such that for all sets A and B with \A\ =^ \B\ and for every equivalence 
relation on A of index at most p there exists an equivalence relation on B and a 
bisection g: A/^^^ — ^ -S/^s satisfying \{a' & A : a =t^ \9{{0'' & A : a for 

all a & A. 

Proof. We let {ai, . . . , Op/}, where p' < p denotes the index of be the set of class 
representatives of A/r^^, and we let [ajps^ := {a' & A : a' ^a o,} denote the equivalence 
class of a in A. Note that we will usually omit the subscript ^a if it is clear from the 
context and instead reserve the letters a and b for elements denoting equivalence classes in 
A and B, respectively. Furthermore, a set will be called small in the following if it contains 
less than t elements and large otherwise. 

The equivalence relation r^b on B is constructed by partitioning the set into p' disjoint 
non-empty subsets {Bi, . . . , Bp/} as follows. If \A\ = \B\, for each class [oj], we choose a 
set Bi with exactly \[ai]\ many elements. If > T, we have to distinguish between 

the treatment of small and large classes. Since |^| >T>p- (t + lcm(M) — 1), lcm(M) > 1, 
and the index of ~^ is at most p, at least one of the equivalence classes contains at least t 
elements, i.e., it is large, and without loss of generality, it is assumed that this is the case 
for [a\\. For each small class [oj], we choose a set Bi with exactly \[ai]\ many elements. 
If [cj] is large, we choose a set Bi containing t + i many elements where ^ is the smallest 
non-negative integer such that |[aj]| = \Bi\ (mod M). The number of elements selected 
according to these rules is at most p ■ {t + lcm(M) — 1) < T < \B\. Since [ai] is large by 
assumption, any possibly remaining elements in J5, that have not been assigned to one of 
the subsets . . . , Bpi yet, can be safely added to B\ without violating the condition that 
|[ai]| = \Bi\ (mod M). 

This partitioning uniquely defines the equivalence relation ^^3'-= Uf=i(-^j ^ ^i) ^■ 
By selecting an arbitrary element of each Bi we get a set of class representatives {61 , . . . , 6p' } 
which directly yields the bijection g: [ai] [bi] for all 1 < i < p' satisfying \[a] \ =^ \9{[<A)\ 
for all a G A by construction. ■ 

The following lemma extends the results on CMSO-cquivalence of large enough sets to 
large enough grids by giving a sufficient condition on the sizes of two grids for the existence 
of a winning strategy for Duplicator in an r-round (mod M) game on the two structures. 
Due to the inductive nature of the proof that involves, in each step, a construction of 
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equivalence classes as in the above lemma, we need as a criterion for the size, for fixed p G N 
and M Cg^ N+, a function /p,M : N ^ N such that, for all r G N"*" and t = fp^M{f — 1), we 
can choose T = fp^M{r) in the previous lemma. One function satisfying, for all r G N^, 
the inequality fp,M{r) > p • {fp.Mir — 1) + lcm(M) — 1) derived from the condition imposed 
on T is fpMir) = 2 • (p*" - 1) • lcm(M). 

Lemma 4.6. Let M C^„N+, r G N and k > 1 be fixed. Then for f{r) := Mi^) = 
(2fc-+i - 2) lcm(M), as given above, &kl^ =f if h h- 

Proof. As motivated by Lemma 14.31 we consider the r-round (mod M) Ehrenfeucht-Fraisse 
game on the corresponding horizontally coloured cliquey grids (S^^'^ and ©^'^^ > ^-nd we show 
by induction on the number of rounds that Duplicator has a winning strategy in this game. 

Intuitively, the proof proceeds as follows. Spoiler's set move induces an equivalence 
relation on the set of columns forming the grid he plays in, and the previous lemma implies 
that Duplicator is able to construct an equivalence relation on the columns of the other grid 
which is similar in the sense that corresponding equivalence classes satisfy certain cardinality 
constraints. Since the grids can be regarded as disjoint unions of these equivalence classes, 
we can argue by induction that corresponding subparts of the two grids, being similar 
enough, cannot be distinguished during the remaining r — 1 rounds of the game. 

The case where ii = I2 is trivial since grids of the same dimensions are isomorphic. 
Thus, we assume in the following that £1,^2 ^ and l\ = I2 (mod M). The claim is 
obviously true for r = 0, hence we assume that it holds for r — 1 and proceed with the 
inductive step. As before, we assume without loss of generality that Spoiler makes his 
moves in (5^^^ since the other case is symmetric. 

A coloured k-column is a {~^, Pi, . . . , Pfc}-structure isomorphic to G!™^ := (S^°|, such 
that a coloured grid can be regarded as a disjoint union of columns. Given a subset S of 
vertices of a grid and one of its coloured /c-columns ^ with universe C, the colour-type of 
induced by S is defined as the isomorphism type of the expansion (£, S* n C) denoted by 
tp(^, S). Given a set J- of A;-columns, each subset S of all of their vertices gives rise to an 
equivalence relation ^5 on F by virtue of Ci '^s ^2 if, and only if, tp(^i,5) = tp(^2)'S'). 
Note that the index of ~s is at most 2^^. 

Assume, Spoiler performs a set move and chooses a subset S in (S^^^^ = (^i U ■ ■ ■ U Oi • 
As described above, S induces an equivalence relation ~5 with at most 2^ equivalence 
classes on the set = {£1, . . . , Oi} of columns forming the grid. For p = 2^, t = f{r — 1) 
and M as given, by the previous lemma, there is an equivalence relation on the set = 

. . . of columns on the Duplicator's grid 0^°^^ since ^1,^2 > /('^)- Furthermore, 

there is a bijection g mapping equivalence classes of columns in one grid to the other. 

Given that the index of both ~5 and ~^ isp'<p = 2^,we can assume {G^i, . . . , Cp'} and 
{€.[, . . . , to be the sets of class representatives of and k^'^, respectively. Duplicator 
now selects the unique set D of elements such that tp{^, S) = tp(C', D) for all 1 < i < p', 
<t(£[<ti] and (t' £ giidii]). 

For each 1 < i < p', we let {(ti) := (d^j^^ \ [(ti] and (e^) := (5l°l \ [(T^] denote the 
substructures of the grids (S^'^'^ and 0^'^'^ induced by the sets of columns [Cj] and 
respectively. By construction, we have =jj^j.-i) li^'iW ^'^^ ^- Thus, depending on 

whether (and hence [£•]) are small or large with respect to the threshold /(r — 1), 
either ((^j) = {^[) or =!f_i (£•) by the induction hypothesis. Since S and D induce the 
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same colour-types on the columns in [Cj] and respectively, we have 

((e:,),5nuniv((e:,))) ^^l^ (((j::),z)nuniv((e:^))) 

for all i, where univ(-) denotes the universe of the respective structure. Thus, iterating 
Lemma 13.51 yields that Duplicator has a winning strategy in the remaining rounds of the 
game from position {S,D). 

If Spoiler makes a point move s, say in column of the grid S^"'^, Duplicator picks an 
arbitrary element d of the same colour in her grid, say in column (L'^. As the substructures 
consisting of just the columns containing the chosen elements are isomorphic, i.e., s) = 
[(t[,d), and by the induction hypothesis we have (£2 U • • • U 1 =^i^ (£'2 U • • • U , Dupli- 
cator can win the remaining (r — l)-round game from position {s,d) by Lemma 



Now we have the necessary tools available to prove the main theorem. 
Theorem 4.7. CMSO C MSO[<]i„„. 

Proof. We show that the class C := {&k£ : k\i} is not definable in CMSO but order- 
invariantly definable in MSO by the sentence V'grid A where 

Vx(min < x) A -'3z{Eh{c, z) V Ey{c, z)) \ 

y) Ty) A T min Tc) I ' 



if = 3 min 3c 
and 



Ehix, y) = X r^h y ^ 3xo3yo 



E^{x,y) = x ~„ y A 3xo3yo 



^diiigi.x,y) = {3z{E^{x,z) AEh{z,y))) 

V [-'3zEv{x, z) A 3z(z ~/i min A z ~^ x A Eh{z, y))) , 

/ xo ~h min A yo ~/i min 

A X ~„ xo A y yo A xo < yo 

y A \fzo{zo ~h min ^ < xq V > Vo) , 

/ Xo min A yo min 

A X Xo A y --ft yo A Xo < yo 

y A yzo{zo ~„ min ^ zo < xq V zq > yo) y . 

As hinted above, the horizontal and vertical edge relations {E^ and E^, respectively) are 
defined using the successor relation which is induced by an arbitrary ordering on the row 
(and column) containing the minimal element (min) which itself serves as the lower left 
corner of the grid, ^^diag defines diagonal steps through the grid that wrap around from the 
top to the bottom row. Finally, (p states that the pair consisting of the lower left corner 
(min) and the upper right corner (c) of the grid is contained in the transitive closure of 
V'diag- Obviously, there is such a sawtooth-shaped path starting at min and ending exactly 
in the upper right corner if, and only if, k\i. 

The second step consists in showing that C is not definable in CMSO. Towards this 
goal, we show that for any choice of r € N and MCgnN+, we can find k,£i,i2 € N, such 
that &kei € C, 0fc^2 C, and &kii =r^ ^ki2 which contradicts the CMSO-definability of C. 

Let r € N and MCfinN+ be fixed. We choose s > r + I such that 2"* \ lcm(M). 
Let k = 2^ h = 2'=''+ilcm(M), and = h + lcm(M). Obviously, h and £2 satisfy the 
conditions of Lemma and thus =r^ ^k£2- 
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Furthermore, £i = k ■ 2^"'^ ^"'"^ lcm(M), hence k \ ii and &kei £ C. On the other hand, 
k 'i £2 = h + lcm(M) by the choice of s, thus ©^^j ^ C. ■ 

5. Conclusion 

We have provided a characterisation of the expressiveness of CMSO in terms of an 
Ehrenfeucht-Fraisse game that naturahy extends the known game capturing MSO-defin- 
abihty, and we have presented a class of structures that are shown, using the proposed 
game characterisation, to be undefinable by a CMSO-sentence yet being definable by an 
order-invariant MSO-sentence. This establishes that order-invariant MSO is strictly more 
expressive than counting MSO in the finite. Modifying the separating example by consid- 
ering a variant of cliquey grids where the two separate equivalence relations are unified into 
a single binary relation and considering, e.g., the class of such grids where the horizontal 
dimension exactly matches the vertical dimension, we can also confirm Courcelle's original 
conjecture. 

Corollary 5.1. CMSO -definability is strictly weaker than MSO [<]inv- definability for gen- 
eral graphs. 

The separating query being essentially a transitive closure query, i.e., the only place 
where monadic second-order quantification is used is in the definition of the transitive 
closure of a binary relation, we can conclude that the same class of structures yields a 
separation of (D)TC^ [<]mt) from (D)TC"'^ (the extension of FO by a (deterministic) transitive 
closure operator on binary relations) and even from (D)TC^ extended with modulo-counting 
predicates since (D)TC C MSO. Finding separating examples concerning higher arity 
(D)TC or even full (D)TC requires further investigation since, in general, MSO C DTC^. 

Following an opposite line of research, it would be interesting to identify further classes 
of graphs, besides classes of graphs of bounded tree- width, on which MSO[<]j„^ is no more 
expressive than CMSO. 
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